Rogel et al. BMC Genomics 2014, 15:575 
http://www.biomedcentral.com/1471 -21 64/1 5/575 



Genomics 



RESEARCH ARTICLE Open Access 



Genomic basis of symbiovar mimosae in 
Rhizobium etli 

Marco A Rogel 1 , Patricia Bustos 2 , Rosa I Santamaria 2 , Victor Gonzalez 2 , David Romero 3 , Miguel Angel Cevallos 2 , 
Luis Lozano 2 , Jaime Castro-Mondragon 1 , Julio Martinez-Romero 1 , Ernesto Ormeho-Orrillo 1 
and Esperanza Martinez-Romero 1 * 



Abstract 

Background: Symbiosis genes {nod and nif) involved in nodulation and nitrogen fixation in legumes are plasmid-borne 
in Rhizobium. Rhizobial symbiotic variants (symbiovars) with distinct host specificity would depend on the type of 
symbiosis plasmid. In Rhizobium etli or in Rhizobium phaseoli, symbiovar phaseoli strains have the capacity to form 
nodules in Phaseolus vulgaris while symbiovar mimosae confers a broad host range including different mimosa trees. 

Results: We report on the genome of R. etli symbiovar mimosae strain Mim1 and its comparison to that from R. etli 
symbiovar phaseoli strain CFN42. Differences were found in plasmids especially in the symbiosis plasmid, not only in 
nod gene sequences but in nod gene content. Differences in Nod factors deduced from the presence of nod genes, 
in secretion systems or ACC-deaminase could help explain the distinct host specificity. Genes involved in P. vulgaris 
exudate uptake were not found in symbiovar mimosae but hup genes (involved in hydrogen uptake) were found. 
Plasmid pRetGFN42a was partially contained in Mim1 and a plasmid (pRetMimlc) was found only in Mim1. Chromids 
were well conserved. 

Conclusions: The genomic differences between the two symbiovars, mimosae and phaseoli may explain different host 
specificity. With the genomic analysis presented, the term symbiovar is validated. Furthermore, our data support that 
the generalist symbiovar mimosae may be older than the specialist symbiovar phaseoli. 
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Background 

Bacterial nitrogen fixation in legume nodules contributes 
to plant nutrition and allows plants to grow in nitrogen 
deficient soils. Genes for plant nodulation and nitrogen 
fixation are plasmid-borne in Rhizobium spp. (reviewed 
in [1]) and symbiovars define host specificity. There are 
over twenty different symbiovars reported not only in 
Rhizobium but also in Bradyrhizobium and in other gen- 
era of nodule forming bacteria [2-7]. The term symbio- 
var was proposed as a counterpart to the term pathovar 
in pathogenic bacteria [2], A theoretical model proposes 
that a single species may exhibit alternative symbiovars de- 
pending on the presence of symbiotic plasmids or symbi- 
otic islands [2], The same symbiovar may be present in 
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distinct species as a consequence of the lateral transfer 
of symbiotic plasmids or islands. Symbiotic genes and 
other genes associated with niche adaptation may have 
evolutionary histories independent of the evolution of 
the chromosomal genes [8]. Two symbiovars are recog- 
nized in Rhizobium etli: phaseoli (conferring the ability 
to nodulate Phaseolus vulgaris) and mimosae (involved 
in nodulating mimosas and P. vulgaris, [9]). Symbiovar 
mimosae strains were isolated from Mimosa affinis in 
Morelos and have a broad host range, including plants 
of M. affinis, Leucaena leucocephala, Calliandra grandi- 
flora, Acaciella angustissima as well as P. vulgaris [9]. 
Symbiovar mimosae was originally distinguished from 
sv. phaseoli by the sequence of a few symbiotic genes 
and by the organization of m/and common nod genes. 
Multiple copies of nifli genes and a nodA gene sepa- 
rated from nodBC found in sv. phaseoli and not in sv. 
mimosae served as a molecular basis to distinguish these 
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symbiovars [10,11]. Phytogenies indicated that symbiovar 
mimosae and phaseoli nifli genes were related [9] and 
similar to the corresponding gene in sv. gallicum [12]. Dif- 
ferent origins of replication were found in sv. phaseoli and 
sv. mimosae symbiotic plasmids and both symbiotic plas- 
mids were compatible [9]. It has been proposed that sv. 
mimosae is older than sv. phaseoli and that the phaseoli 
symbiotic plasmid was selected by P. vulgaris [13]. P. vul- 
garis is a recent species (probably two million years old 
[14]), while mimosas seem to be older. 

Mimosas are distributed worldwide with Brazil and 
Mexico as main diversification sites. Mimosas in South 
America are nodulated by p-Proteobacteria like Burkhol- 
deria or other (3-Proteobacteria [15-19] while mimosas 
in Mexico are only exceptionally nodulated by Burkhol- 
deria (unpublished). Mimosas from Mexico and Brazil 
are phylogentically separated [20]. Additionally abiotic 
conditions like pH and soil nitrogen content may ac- 
count for their differences in symbionts [21]. Native mi- 
mosas in India are nodulated by sinorhizobia [22], that 
we have also found in some mimosa nodules in Mexico 
(unpublished). 

Based on multilocus enzyme electrophoresis analysis, 
M. affinis isolate Miml was found to group closely to R. 
etli sv. phaseoli CFN 42 [9]. Other M. affinis isolates 
such as Mim2 were separated from CFN42 (Figure two 
in [9]) and thus Mim2 has been recently reassigned to 
Rhizobium phaseoli [1,23]. Therefore we recognize now 
that symbiovar mimosae exists in R. phaseoli as well as 
in R. etli. It is the aim of this work to define the genomic 
differences between two R. etli strains (CFN42 and 
Miml) representing the symbiovars phaseoli and mi- 
mosa, respectively. 

Methods 

Strains and growth conditions 

Rhizobium strains were grown overnight at 28°C in PY 
medium [24] after recovery from glycerol stocks at -80°C. 
Bacterial strains to be inoculated on plants were grown on 
solid PY media and resuspended in water to an OD 600 of 
0.5. For PCR or DNA isolation, bacteria were grown in li- 
quid PY cultures [24]. 

Plasmid profiles 

Plasmid profiles were visualized on agarose gels according 
to the protocol described by Hynes and McGregor [25]. 
Plasmid patterns from R. etli CFN42 or R. leucaenae 
CFN299 were used as references. 

Plant nodulation assays 

L. leucocephala seeds were treated with concentrated 
sulfuric acid for 15 min, rinsed with water and surface 
disinfected with sodium hypochlorite as described [26]; 
the same procedure was used to disinfect P. vulgaris 



seeds. Seeds were germinated in water- agar plates in the 
dark and transferred to flasks after 3 days. L. leucoce- 
phala plants were grown in vermiculite flasks with N 
free Fahraeus nutrient solution for 40 days and P. vul- 
garis plants in agar flasks with the same nutrient solu- 
tion for 14 days. 

Genome sequencing, assembly and annotation 

Genomic DNA from R. etli Miml was sheared to pro- 
duce two paired-end libraries for 454 pyrosequencing, 
one with 3 Kb inserts and the other with 8 Kb inserts. 
An additional 3 Kb library was sequenced only at one 
end. The total amount of reads were 512,236 paired 
reads and 112, 079 single reads. Library construction 
and sequencing was done at Mogene LC (St. Louis, MO, 
USA). Additionally two paired-end libraries were con- 
structed, one with 200 bp inserts and the second with 2 
Kb inserts. Both libraries were sequenced by Alumina at 
BGI (Beijing, China). A total amount of 16, 000, 000 
short-paired readings (50-60 bases) were assembled. 
To improve the scaffolding a BAC library was constructed 
in plndogo BAC- 5™ vector by BIOS&T (Quebec, Canada) 
using fragments from a partial genomic DNA Hindlll 
digestion. 105 BAC-ends were sequenced with ABI3730xl 
sequencer by Sanger method. Additionally, three BACs 
were completely sequenced with the same technology at 
the Center for Genomic Sciences (Cuernavaca, Mexico). 
Two of these BACs were selected by hybridizing with nifH 
and other pSym probes; they embraced half of the pSym 
plasmid sequence. The third BAC was from the 
chromosome. Final assembly of the symbiotic plasmid 
was obtained using sequence reads from the three 
sources of information: BAC sequences, Alumina, and 
454 pyrosequencing. 

Different assembly strategies were used with the follow- 
ing programs: Newbler 2.5.3 (ROCHE), Velvet 1.1.06 [27], 
Sspace-Basic 2.0 [28], and Consed v23 [29]. ORFs were 
predicted with Glimmer 3 [30], and annotations were 
done in Artemis 12.0 graphic display [31] using previous 
annotations made for R. etli CFN42 [32] and comparing 
with the non-redundant data base of the Genbank [33], 
Interpro database [34], and IS database (http://www-is. 
biotoul.fr). 

Sequence analysis 

Average nucleotide identity (ANI) was calculated using the 
JSpecies software [35]. The DNA conservation between two 
genomes or replicons was estimated by obtaining an align- 
ment with NUCmer [36], run with default parameters, and 
dividing the summed lengths of all aligned regions by the 
length of the genome or replicon and expressing the value 
obtained as a percentage. Common and specific protein 
families between R. etli CFN42 and Miml were detected 
using MCL as described [37]. 
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Genomic islands 

Alien Hunter vl.7 software [38] was used to analyze the 
chromosome sequence of R. etli sv. mimosae Miml and 
R. etli CFN42. The minimum region length for HT 
detection was 5 kbs. The score thresholds were 12.92 
and 14.96 for Miml and CFN42, respectively. 

Phylogenetic analysis 

Alignments were performed with MUSCLE [39] and 
manually verified. Maximum likelihood trees were gene- 
rated with PhyML [40] with tree node support evaluated by 
bootstrap analysis based on 1000 pseudoreplicate datasets. 
Phylogenetic relationships were also assessed by Bayesian 
inference using MrBayes 3.1 [41]. Analyses were initiated 
with random starting trees, run for 2,000,000 generations 
and three separate analyses were executed. Markov chains 
were sampled every 100 generations. We discarded 25% of 
trees as "burn in". 

Genome accession numbers 

Sequences and annotations were deposited in the 
Genbank database under accession numbers CP005950 
(chromosome), CP005951 (pRetMimla), CP005952 (pRe- 
tMimlb), CP005953 (pRetMimlc), CP005954 (pRetMimld), 
CP005955 (pRetMimle) and CP005956 (pRetMimlf). 

Results and discussion 

Genome of R. etli symbiovar mimosae strain Miml 

The final assembly of the R. etli sv. mimosae Miml genome 
rendered seven circular molecules: one chromosome and 
six plasmids at 150x coverage on average. The chromo- 
some was 4.8 Mb in size while the plasmids ranged in size 
from 181 kb to 1.08 Mb (Figure 1). Average Nucleotide 



Identity (AM) and the percentage of conserved DNA 
between Miml and CFN42 were 98.6% and 82.4% respec- 
tively on a whole genome analysis, confirming that both 
strains belong to the same species. Lower AM (less than 
90%) was found between Miml and strains of other species 
such as R. phaseoli and R. leguminosarum. 

There were more than twice as many unique genes in 
Miml than in CFN42, mainly in plasmids. The respective 
chromosomes of each strain had around 260 unique genes. 
In chromosomes, 35 genomic islands were identified only in 
Miml and 17 only in CFN42. (Figure 2); genes found in 
Miml genomic islands are shown in Additional file 1: Table 
SI. Examples of unique genes found in Miml and not in 
CFN42 are those encoding cytochrome oxidases, some 
chaperonins, dipeptide transporters, lactate dehydrogenase, 
a PHB depolymerase, a ferritin-like protein, exopolysacchar- 
ide biosynthesis genes, a type VI secretion system and mena- 
quinone biosynthesis as well as many hypothetical genes. 

The conserved genome in R. etli strains Miml and 
CFN42 includes not only the chromosome but two 
extrachromosomal replicons (pRetMimla-pRetCFN42b 
and pRetMimld-pRetCFN42e) that have been desig- 
nated as chromids in CFN42 [42] and one plasmid (pRe- 
tMimlb-pRetCFN42c) (Figure 3). Each of the chromids 
had less than 20 unique genes and the chromid pairs 
had an AM around 99% (Table 1). The small replicons 
pReCFN42a and pRetMimlc were partially conserved 
and large genomic differences were found in the sym- 
biotic plasmids (Figures 1 and 4, Table 1). 

Miml and CFN42 chromosomes were syntenic, as were the 
chromid pairs pRetMimla-pRetCFN42b and pRetMimld- 
pRetCFN42e, and plasmids pRetMimlb-pRetCFN42c. In 
comparison to R leguminosarum 3841, Miml plasmid 
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Figure 1 Schematic representation of plasmid patterns of Rhizobium etli symbiovars phaseoli and mimosae and R. leguminosarum. 

Equivalent replicons are indicated with the same color, "indicates the symbiotic plasmid. C indicates chromids. 
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Table 1 Average nucleotide identity (ANI) and conservation in percent between R. etli sv. mimosae Miml and R. etli sv. 
phaseoli CFN 42 or R. leguminosarum 3841 replicons 



Miml replicons 



ANI*/conservation § to the corresponding replicons in 
R. etli CFN42 R. leguminosarum 3841 



pRetMimlf 


97.4/51.1 (pReCFN42f) 


86.8/31.4 (pRLI 2) 


pRetMimle (pSym) 


89.2/8.8 (ReCFN42d) 




pRetMimld 


98.9/97.5 (pReCFN42e) 


88.2/61.2 (pRLII) 


pRetMimle 


86.4/1 1 .4 (pReCFN42a) 




pRetMimlb 


99.1/92.5 (pReCFN42c) 


87.2/55.7 (pRLIO) 


pRetMimla 


99.2/99.9 (pReCFN42b) 


87.6/58.4 (pRL9) 


RetMimlCh 


99.2/97.8 (ReCFN42Ch) 


88.1/78.1 (RLChr) 



* Average nucleotide identity was calculated using all portions of the replicons that could be aligned with the nucmer program. These regions included both genie 
and intergenic regions. 

Percentage of the Miml replicon involved in the ANI calculation is expressed here as the conservation value. 



equivalences were similar to those reported previously for R 
etli CFN42 [8]. ANI values were lower with R leguminosarum 
3841 that among the R etli strains (Table 1). 

All extrachromosomal replicons in Miml belong to the 
rep ABC plasmid family [43]. The protein products of the 
repABC operons of the replicon homologous pairs (pRe- 
tMiml a-pRetCFN42b, pRetMimld-pRetCFN42e and 
pRetMimlb-pRetCFN42c) were almost identical with a 
sequence identity greater than 97.51%, strongly sugges- 
ting that members of each replicon pair belong to the 
same incompatibility group. 



pRetCFN42f-pRetMim1f comparison 

The largest extrachromosomal replicon in Miml (pRe- 
tMimlf) was only partially conserved in the putative 
chromid pRetCFN42f (Figures 1 and 4). pRetMimlf and 
pRetCFN42f possess two repABC operons: repABCl and 
repABC2, The sequence identity between the two repABC 
operons in Miml is low. The degree of sequence identities 
between the corresponding repABC genes of pRetCFN42f 
and pRetMimlf is large enough to suggest that both plas- 
mids share the same incompatibility group and evolutio- 
nary origin. However only 51% of the pRetMimlf replicon 




B 






Figure 4 Graphic representation of the alignments of CFN42 (top)-Miml (bottom) replicons. A) pRetMiml a-pRetCFN42b, B) pRetMiml b- 
pRetCFN42c, C) pRetMiml d-pRetCFN42e, D) pRetMiml e-pRetCFN42d, E) pRetMiml f-pRetCFN42f. ORFs of each replicon are depicted with 
light blue arrows in their corresponding reading frame. Syntenic segments oriented in the same or opposite direction are joined by red and blue 
regions, respectively. 
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is conserved in CFN42 while 86% of pRetCFN42f was 
found in pRetMimlf (Figures 1 and 4). 

Type VI secretion system genes were only found in the 
megaplasmid pRetMimlf and the conserved impB compo- 
nent is phylogenetically related to the corresponding genes 
in Rhizobium sp. KIM5 (corresponding to PEL1 lineage, 
[44]) and R. leguminosarum strain 3841. Miml pRetMimlf 
has a duplicated citrate synthase gene as in sv. tropici sym- 
biotic plasmids. The plasmid duplicated citrate synthase 
gene in sv. tropici is required for eliciting a normal number 
of nodules and is regulated differently than the copy in the 
chromosome [45,46]. A plasmid borne citrate synthase 
was not found in CFN42. Phylogenetic analysis of citrate 
synthase genes showed that the Miml chromosomal 
gene product is identical to that of CFN42, while the gene 
in pRetMimlf has a novel sequence only distantly related 
to a plasmid copy of R. gallicum R602 (55% identity). 

Genes such as rail and raiR were found in the conserved 
region of pRetCFN42f and pRetMimlf. Rail produces 
homoserine lactones and RaiR is the transcriptional regula- 
tor. The rai system in R. phaseoli sv. phaseoli CNPAF 512 
affects nodule number but not nitrogen fixation in P. 
vulgaris [47]. This system also controls growth in R. 
phaseoli [48] and if this is the case in R. etli, this would 
explain its conservation in both symbiovars. 

A transcriptomic study compared the genome expres- 
sion of R. phaseoli Ch24-10 in maize and P. vulgaris 
rhizospheres [49]. Over 50% of the extrachromosomal 
genes highly expressed in P. vulgaris but not in maize 
roots were found in a Ch24-10 replicon equivalent to 
pRetCFN42f. It seems that genes in this replicon are 
involved in plant specific interactions. 

Symbiosis plasmid gene comparison 

Large differences were observed between the symbiotic 
plasmids of CFN42 and Miml (Figure 4). Around 10% and 
15% of the symbiotic plasmids of Miml and CFN42 had 
conserved syntenic regions with an average nucleotide 
identity of 89.2% (Table 1). 

Differences in symbiosis genes in CFN42 and Miml 
genomes are shown in Table 2. The Nod factor from R etli 
sv. phaseoli strain CFN42 is a pentamer of Af-acetylglucosa- 
mine with an acetyl fucose at the reducing end and methyl 
and carbamoyl groups at the non reducing end [50]. The 
heterologous expression in Azorhizobium caulinodans of Nod 
modification genes showed that fucosylated Nod factors were 
the most suitable to induce P. vulgaris nodulation [51]. In 
symbiovar mimosae no genes related to Nod factor fucosyla- 
tion (nodZ) were observed (Table 2), in their place, nodHPQ 
genes that modify the Nod factor with sulfate, were found. 
Such genes are present in sv. tropici strains that are also Leu- 
caena symbionts [10]. Like R. etli strain CFN42, Miml may 
produce nodulation factors bearing carbamoyl groups at their 
non reducing end residues but the position of these 



Table 2 Relevant symbiotic plasmid differences between 
R. etli sv. mimosae Miml and R. etli sv. phaseoli CFN 42 



Gene* symbiovar 





mimosae 


phaseoli 


nodHPQ 


+ 


- 


nodZ 


- 


+ 


nolL 




+ 


nodO 


+ 




nolO 




+ 


nodil 


+ 




fixKL 


+ 




fxkR 


+ 




acdS 


+ 




teu 




+ 


hup-hyp 


+ 





^Functions of each gene are explained in the text. 



decorations must differ because their pSyms encoded distinct 
carbamoyl transferases, NolO in CFN42 and NodU in Miml. 
Carbamoylation at the C-6 position introduced by NodU 
maybe promotes Leucaena nodulation [52,53]. Both Miml 
and CFN42 symbiotic plasmids carry nodS involved in methy- 
lation at the non reducing end, a decoration that is essential 
for bean and Leucaena nodulation in R. tropici CIAT 899 
[52]. A nodO homologue, 70% identical to that of Rhizobium 
sp. BR816, was found only in the sv. mimosae pSym. It has 
been shown that heterologous expression of nodO can 
improve nodulation of L. leucocephala by different rhizobia 
and can even extend the host range [54,55]. 

Miml nod gene phylogenies are congruent, resembling 
the corresponding genes from sv. giardinii (not shown) 
while Miml nijH genes resemble those from sv. phaseoli. 
Different NodDs in phaseoli and mimosae symbiovars may 
reflect their affinities for the different flavonoids exuded by 
the different host plants. Miml nodH gene (encoding the 
sulfotransferase involved in the synthesis of the Nod factor) 
resembles the corresponding gene in Rhizobium sp. IE4771 
that represents a novel genomic lineage related to R. etli 
and R. phaseoli [1,44]. 

acdS gene coding ACC-deaminase was found in the sym- 
biotic plasmid of Miml but not in CFN42. ACC-deaminase 
decreases the amount of ACC that is a precursor of ethylene 
that may diminish nodule number. A heterologous ACC- 
deaminase in Rhizobium sp TAL 1145 enhanced nodulation 
in Leucaena [56]. 

In CFN42, the fixGHIS-fixNOQP genes required for bio- 
synthesis of the symbiotic terminal oxidase are present in 
the pSym and also in pRetCFN42f [57]. The regulatory 
genes fixK and fixL are adjacent to this reiteration in 
pRetCFN42f while a fixK pseudogene is found in the pSym 
[58]. In Miml, we found that the symbiotic terminal oxidase 
genes are also reiterated in pRetMimlf but, in contrast to 
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CFN42, the sv mimosae pSym carried complete fixK and 
fixL genes. The JixGHIS-fixNOQP-fixKL region shared by 
pSym and pRetMimlf plasmids in Miml is 95% identical 
while the reiterated regions in CFN42 are only 87% identi- 
cal. The recently described fxkR gene in pRetCFN42f [59], 
coding for a response regulator that acts in conjunction with 
FixL and FixK, is present in the sv. mimosae pSym. There is 
a reiteration of this gene in pRetMimlf as well. 

Genes involved in P. vulgaris exudate uptake {teu genes, 
[60]) are found in the symbiotic plasmids in symbiovar 
phaseoli strains R. etli CFN42 and R. phaseoli CIAT652 but 
they were not found in the Miml genome. Rhizobium mu- 
tants in teu genes had reduced nodulation competitiveness 
in P. vulgaris [60]. 

The symbiosis plasmid of Miml has genes for a type III 
secretion system (T3SS) that are more closely related to 
those found in Mesorhizobium and Sinorhizobium strains 
than to those encoded in the CFN42 pSym (not shown). 
This difference may contribute to the disparate host 
ranges displayed by sv. phaseoli and mimosae strains con- 
sidering the function of rhizobial T3SS in specificity [61]. 

A cluster of hup-hyp genes encoding the components of an 
uptake hydrogenase (Hup) was found in the pSym of Miml 
but not in CFN42. The Miml products showed high iden- 
tities (>70%) to their counterparts coded in R leguminosarum 
and R. tropici [10,62]. R. tropici CIAT 899 lacks several hup 
genes and displays a Hup minus phenotype. In Miml, all 
genes except hupE are present. HupE is an uptake transporter 
for nickel [63], a metal required for Hup function. Since 
another nickel transporter is encoded elsewhere in Miml 
symbiotic plasmid, the Hup system could be functional. 

Miml symbiotic plasmid has a rep ABC origin of repli- 
cation as well as a repC gene that are not phylogenetically 
related to the R. etli CFN42 corresponding genes. pRe- 
tMimle repC resembles those from R. gallicum and Rhizo- 
bium sp. sv. giardinii IE4771 (corresponding to PEL1 
lineage). Miml repABC from the pSym resembles the 
corresponding genes in R. endophyticum CCGE502 but an 
extensive plasmid conservation was not observed. CCGE502 
is Nod" and does not have a symbiotic plasmid [64]. 

The analysis of insertion sequences in R. etli CFN42 sug- 
gested that the symbiotic plasmid did not significantly share 
IS sequences with the rest of the genome [65]. This was 
interpreted as evidence that the pSym was a recent acquisi- 
tion in this bacterium. The analysis of IS sequences in Miml 
indicated that the symbiotic plasmid had the largest number 
of IS sequences, some of them shared with the chromosome, 
pRetMimlf and pRetMimlb, this may perhaps indicate that 
this symbiotic plasmid has an older history with the R. etli 
genomic background than the phaseoli plasmid. Miml has a 
large number of IS66 that are common in rhizobia. 

Genome similarities were found among the different 
Miml replicons. pRetMimle (pSym) has similar sequences 
to pRetMimle and the same is observed among pRetMimle 



(pSym) and pRetMimlf (Figure 5). The similarities of Miml 
symbiotic plasmid and the putative chromid pRetMimlf 
could support that symbiovar mimosae symbiotic plasmid 
is ancestral in R. etli. In contrast the phaseoli symbiotic 
plasmid was found dissimilar to the rest of the genomic 
background in R. etli, except to pReCFN42a [32,66] . 

In R. etli CFN42, the symbiotic plasmid and pRetCFN42a 
have common sequences such as tra and vir genes and 
repeated sequences that mediate the natural cointegration of 
both plasmids for the conjugative transfer of CFN42 pSym 
[67]. In contrast to pReCFN42a, pRetMimle from Miml does 
not seem to participate in the transfer of the Miml symbiotic 
plasmid, which we have been unable to transfer to other 
bacterial hosts (Marco A. Rogel, unpublished observations). 

Plant-interaction genes not in the symbiosis plasmid 

Some genes involved in plant interactions were found 
conserved in both symbiovars, such as those encoding 
Rmr extrusion pumps that may be involved in elimina- 
ting plant produced phytoalexins. R. etli mutants in 
these genes had reduced nodulation [68]. Those genes 
are encoded in chromid pRetCFN42b in CFN42 and in 
the corresponding replicon pRetMimla. rmr A gene had 
97% identity and rmrB and rmrR genes 98% in CFN42 
and Miml genomes. Homologous genes were found 
being expressed in different plant rhizospheres [1,49,69]. 

Even though sv. mimosae strains are capable of forming 
nodules in Leucaena, we did not find genes resembling 
mid or pyd genes involved in the catabolism of the toxic 
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amino acid derivative mimosine found in Leucaena plants. 
Such genes were described from a Rhizobium sp. strain 
TAL 1145 (related to R. tropici) that was isolated from 
Leucaena plants [70]. Mimosa pigra has a much lower 
level of mimosine that Leucaena plants [71] and data for 
other mimosa species is not available. 

Symbiovar phaseoli is prevalent in different Phaseolus 
vulgaris nodule bacteria 

R. etli sv. mimosae strains have a broader host range than 
symbiovar phaseoli strains. In particular, L. leucocephala 
plants served as a host to distinguish symbiovar mimosae 
strains. Thirty-six P. vulgaris nodulating bacteria (with R. 
etli-kke 16S rRNA gene sequences) obtained from the rain 
forest of Los Tuxtlas in Mexico corresponded to symbio- 
var phaseoli on the basis of nodA gene organization [10] 
and for being unable to nodulate Leucaena plants. We 
found that some strains that were previously classified as 
R. etli such as CIAT652, Ch24-10, CNPAF512, 8C-3 and 
Brasil5 now assigned to R. phaseoli [1], as well as others 
like Kim5, GR56 and CIAT 894 now assigned to recently 
named novel lineages [44] corresponded to sv. phaseoli 
when we analyzed their genomes. Symbiotic plasmids are 
highly conserved in symbiovar phaseoli [66] perhaps from 
being recently evolved [13]. Considering that the majority 
of P. vulgaris nodule isolates tested corresponded to sv. 
phaseoli we may conclude that this symbiovar is better 
adapted to its host, thus phaseoli seems to be a specialist 
symbiovar having a narrow range not including mimosa 
plants. The phaseoli symbiovar is found in several Rhizo- 
bium species or lineages (R. gallicum, R. giardinii, R, 
phaseoli, R. etli and Rhizobium sp. corresponding to PEL1 
lineage). The widespread of this symbiovar may be in rela- 
tion to its host historic worldwide distribution and to the 
transferability of the symbiotic plasmid, seemingly an 
epidemic plasmid. Besides having the phaseoli symbiovar, 
the Rhizobium species mentioned above have additional 
generalist symbiovars: gallicum or giardinii or mimosae 
Having alternative symbiovars with different host ranges 
may be advantageous in rhizobia, as it expands their 
legume niches and allows them to avoid the specialist- 
generalist dilemma. 

Conclusions 

The term symbiovar is validated with genomic analyses that 
show that a common genomic background may harbor 
different symbiotic plasmids determining host specificity. 
However, besides differences in the symbiotic plasmids 
there were differences in other ERs and in the chromo- 
somes in the two strains analyzed, CFN42 and Miml. In 
Miml, Nod factors with sulfate modifications, secretion 
systems or ACC-deaminase may help explain the 
extended host range of symbiovar mimosae. In CFN42, 
teu genes that participate in exudate uptake [60] and 



genes involved in Nod factor fucosylation (nodZ) may 
contribute to P. vulgaris host specialization. The discus- 
sion that mimosae is older than phaseoli may apply to 
gallicum and giardinii, thus we propose that gallicum and 
giardinii are older than phaseoli. 
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